A Robust Three-Stage Hybrid Framework for English to Bangla Transliteration
نویسندگان
چکیده
Phonetic typing using the English alphabet has become widely popular nowadays for social media and chat services. As a result, text containing various Bangla words phrases increasingly common. Existing transliteration tools display poor performance such texts. This paper proposes robust Three-stage Hybrid Transliteration (THT) framework that can transliterate both phonetic typed satisfactorily. is achieved by adopting hybrid approach of dictionary-based rule-based techniques. Experimental results confirm superiority THT as it significantly outperforms benchmark tool.
منابع مشابه
A Comprehensive Roman (english)-to-bangla Transliteration Scheme
A transliteration scheme from Roman (English) to Bangla can help increase the use of Bangla in essential and diverse computing areas such as word processing, Internet and mobile communication and information query and retrieval. The Bangla script’s irregular phonetic nature and its large repertoire of consonant clusters (juktakkhors) create a large gap between the pronunciation and the orthogra...
متن کاملHow to Translate Unknown Words for English to Bangla Machine Translation Using Transliteration
Due to small available English-Bangla parallel corpus, Example-Based Machine Translation (EBMT) system has high probability of handling unknown words. To improve translation quality for Bangla language, we propose a novel approach for EBMT using WordNet and International-Phonetic-Alphabet(IPA)-based transliteration. Proposed system first tries to find semantically related English words from Wor...
متن کاملA Hybrid Approach to English-Korean Name Transliteration
This paper presents a hybrid approach to English-Korean name transliteration. The base system is built on MOSES with enabled factored translation features. We expand the base system by combining with various transliteration methods including a Web-based n-best re-ranking, a dictionary-based method, and a rule-based method. Our standard run and best nonstandard run achieve 45.1 and 78.5, respect...
متن کاملEnglish to Persian Transliteration
Persian is an Indo-European language written using Arabic script, and is an official language of Iran, Afghanistan, and Tajikistan. Transliteration of Persian to English—that is, the character-bycharacter mapping of a Persian word that is not readily available in a bilingual dictionary—is an unstudied problem. In this paper we make three novel contributions. First, we present performance compar...
متن کاملA Hybrid Approach of English- Hindi Named-entity Transliteration
In recent years, machine transliteration has gained a center of attention for research. Both machine translation and transliteration are important for e-governance and web based online multilingual applications. As machine translation translate source language to target language which results in wrong translation for named entities. Named entities are required to be translated with preserving t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International journal on natural language computing
سال: 2022
ISSN: ['2278-1307', '2319-4111']
DOI: https://doi.org/10.5121/ijnlc.2022.11104